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ABSTRACT 


"HThis  paper  reviews  and  applies  certain  results  concerning  the  matrix 
pseudoinverse  to  the  general  theory  of  estimable  functions  and  minimal 
variance  estimates.  The  paper  is  divided  into  two  sections.  The  first 
section  reviews  and  extends  certain  known  results  concerning  the  matrix 
pseudoinverse.  This  section  is  essentially  nonstatistical.  The  second 
section  uses  results  in  the  first  section  to  state  and  prove  a  generalized 
version  of  the  Gauss -Markoff  Theorem  (Reference  4^  page  14)  concerning 
unbiased  linear  estimates  having  minimal  variance.  In  the  third  section, 
an  additional  theorem  is  proven,  which  together  with  the  preceding  material 
provides  a  theoretical  foundation  for  parameter,  estimation  in  orbit  deter¬ 
mination  work.  This  foundation  is  then  exploited  to  provide  formulae  for 


such  parameters. 
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I.  •  PROPERTIES  OF  THE  MATRIX  PSEUDOINVERSE 

All  matrices  will  be  designated  by  capital  English  letters;  the  notation 

A(mxn)  =  (a..)  means  that  A  is  an  mxn  matrix  having  a.,  as  the  element 
th  th 

in  the  i  row  and  j  column  (i  =  1,  .  .  .  ,  m  =  number  of  rows;  j  =  I,  .  .  .  ,  n  =  number 
of  columns).  All  matrices  considered  will  be  presumed  to  have  elements  a_ 
which  are  real  numbers.  A*  designates  the  transpose  of  A;  thus  if  A(mxn)  = 
(a^.)  then  A*(nxm)  =  (b^.)  where  b^.  =  a.^.  I^  is  the  (nxn)  identity  matrix 
with  I's  down  the  diagonal  and  zeros  elsewhere;  visually  the  subscript  n  will 
be  dropped  if  the  dimension  of  I  is  clear  from  the  context,  e”^  is  the  (nxn) 
matrix  (Sj^j)  such  that  e^^  =  1  and  all  other  e^^  =  0;  usually  the  superscript 
n  will  be  dropped  if  the  dimension  of  E^^  is  clear  from  the  context.  A  (mxn) 
is  called  square  if  m  =  n.  If  A  is  square  then  |a|  designates  the  determin¬ 
ant  of  A.  If  A  is  square  and  |a|  ^  0  then  A  ^  designates  the  inverse  of  A, 
and  A  is  said  to  be  nonsingular.  6^  designates  the  set  of  all  real  (nxl) 
matrices;  £  will  also  be  called  Euclidean  n- space,  and  elements  of  £ 
called  n  dimensional  vectors,  or  just  vectors  if  the  dimension  is  clear  from 
the  context.  The  symbol  0  will  be  used  to  designate  either  a  matrix  which  is 
identically  zero  or  the  scalar  real  number  zero,  depending  upon  the  context. 
Given  any  x  in  £^,  ljx||  *  V  x*  x  and  is  called  the  "norm"  of  x.  The  symbol 

will  sometimes  be  used  for  "in"  in  the  set  theoretic  sense;  e.  g.  ,  "given 
any  x  €  £^"  means  given  any  x  which  is  an  element  (a  member)  of  the  set  £^. 

Let  W  (mxm)  be  positive  definite  so  that  W  =R’''  R  for  some  square  R, 

IrIVO,  R  =  R’''.  For  any  ae£  ,  define  ||a||^  =  a* R*  Ra  =  ||Ra|| 

Theorem  1:  For  every  (mxn)  matrix  A,  there  exists  a  unique  (nxm) 
matrix,  which  we  shall  designate  as  A^,  that  satisfies  the  following  four 
identities; 


The  bulk  of  the  material  in  Section  I  constitutes  a  review  and  restatement  of 
results  to  be  found  in  (1)  -  (3)  convenient  for  present  purposes.  In  general, 
the  absence  of  a  proof  following  the  statement  of  a  result  indicates  that  the 
result  and  proof  may  be  found  in  (1)  -  (3),  and  vice  versa. 
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(1)  AA'*’  A  =  A 

(2)  A+AA+  =  A+ 

(3)  {AA'^)"  =  AA+(mxm) 

(4)  (A+A)*  =  A+ A{nxn) 

Furthermore, 

(Rl)  if  D  =  is  square  (m  =n)  and  diagonal  (d^^  =  0  for  i  j) 

then  D"*"  =  (d+)  is  defined  by  dt.  =  0  for  i  ^  j,  d.+  =0  if  d..  =0,  dt.  =  d.’.^  if 

'  ij'  11  11  11 

d..  0. 

11 

(R2)  if  A*  A  =  PDP*,  where  PP*  =  P*P  =  I,  and  D  is  diagonal, 
then  A+  =  PD+P*a"". 


(R3)  if  A  =  BC,  where  the  columns  of  B  are  linearly  independent 
and  the  rows  of  C  are  linearly  independent,  then  A"*"  =  C  (C  C  )  (B  B)  B  , 
Thus 

(R3.  1)  A'*'  =  (A'^A)  ^  a*  if  the  columns  of  A  are  linearly 


independent. 


independent. 


(R3.  2)  A"*"  =  A*  (A  A*)  ^  if  the  rows  of  A  are  linearly 


(R3.  3)  A"*"  =  A’^  if  A  is  square  and  nonsingular. 

Theorem  2:  The  matrix  correspondence  A  -•  A'*’  satisfies  the  following 


(Rl) 

(A+)+  =  A 

(R2) 

(A  V  =  (A+)  = 

(R3) 

+  >!<  j'.c 

A^AA  =A 

(R4) 

>:<+>!< 

A  AA  =  A 

(R5) 

AA  A  =  A 

(R6) 

A  A  A  =  A 

(R7) 

A*+A*A  =  A 

=  A 
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Then 


(AB)  + 


Note  that  C  satisfies  properties  (1),  (2),  (3)  but  not  (4)  for  (AB)'*’  listed  in 
Theorem  1 . 


Remark;  Let 


Then 


A+  = 


Ao.i 

0.  1 

Vo.  2 

0.  2 

(0.1)  A* 


The  eigenvalues  of  A  are  3  and  0.  The  eigenvalues  of  A^  are  clearly  0.  3 
and  0.  The  lack  of  any  "inverse"  relation  between  3  and  0.3  shows  that 
(Rl6)  of  Theorem  2  needs  some  hypothesis.  More  generally,  let  A  be  the 
singular  2x2  matrix  (a 8  /  -1). 


A 


;  then  (1  +  a8)  ^ 


the  notation  meaning  A  has  the  eigenvalue  1  +  a8  corresponding  to  the  eigen¬ 
vector 


+  +  2-12-1* 
etc.  Now  A  is  readily  computed  to  be  A  =  (1  +  8  )  (1  +ci  )  A  .  The  non¬ 

zero  eigenvalue  (1  +  a8)  of  A  thus  corresponds  to  the  non-zero  eigenvalue 
(1  +  8^)"^  (1  +  a^)"^  (1  +  a8)  of  A'''.  The  latter  is  (1  +  a8)’^  if  and  only  if 
a  =  8i  which  is  true  if  and  only  if  AA^  =  A^  A. 
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Examples: 

(El)  Et.  =  E.. 

(E2)  If  A  =  A*  =  A^,  then  A'''  =  A 


Definition:  For  A(mxn)  and  b  (m  x  1) ,  ^  (A,  b)  =  A^b  +  (Ij^  -  A+ A) 

Lemma  1:  The  element  of  least  norm  in  U  (A.b)  is  A+b. 

Proof:  Let  x€M(A,  b).  Then  x  =  A^b  +  (I  -  A"*^  A)  v  for  some  v  G  £^,  and 
||  x|P  =  l|  A'*'b||^  +  11(1  -  a”*"  A)  v||^  since  (I  -  A'*'A)  v  and  A'^b  are- orthogonal; 

[(I  -  A'^'A)  v]*  (A'''b)  =  v"'(r- A'^A)*  A+b  =  v*  (I  -  A+ A)  A'''b  =  0.  Thus 
i|x|!;  >  II A'*'b II  unless  || (I  -  A”*"  A)  v||  =  0,  i.  e.  ,  ||x]|  >  ||A'^b||  unless  x  =  A^b, 
q.  e.  d. 

Lemma  2:  Let  xg£^.  Then  x€?/(A,  b)  if  and  only  if  A(x- A'*^b)  =  0. 

Proof:  Suppose  A  (x  -  A'^b)  =  0.  Let  x  =  A'*^b  +  y.  Then  Ay  =  0  so  that 
y  =  A"'’  Ay  +  (I  -  A"*"  A)  y  =  (I  -  A"*”  A)  y,  x  =  A'*'b  +  (I  -  A'*'  A)  y  G  M(A.  b) .  The 
converse  is  trivial,  q.  e.  d. 


Lemma  3:  Let  xGG^.  Then  x  tp-(A,h)  if  and  only  if  A  (A  x  -  b)  =  0. 

Proof:  We  shall  use  Lemma  2  in  the  proof.  We  must  show  that  A  (x  -  A^b) 

'  ^  5j<  X 

=  0  if  and  only  if  A  (Ax-b)  =  0.  Suppose  that  A  (x  -  A  b)  =0.  Then  0  =  A  A 

X  i\i  5je  iffi 

(x  -  A  b)  =  A  Ax -A  b  using  (R4)  of  Theorem  2.  Conversely,  let  A  Ax  =  A  b. 
Then  A  ^A  Ax  =  A'^A  b  =  Ax  =  A A^b  using  (R7)  of  Theorem  2  and  the  fact 
that  (AA^)  '  =  AA^  =  A^^A‘,  q.e.  d. 


Lemma  4:  Let  A(mxn),  N(nxn)  nonsingular,  M(mxm)  nonsingular. 
Then  (AN)  (AN)'*'  =  AA"^  and  (MA)"*"  (MA)  =  A. 
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Proof:  Let  L 


(AN)  (AN)"''.  R  s  aa'*'.  Then  \}  =  L  =  L*  and 

-1  A+x 


=  R  =  r",  LR  =  (AN)  (AN)'^  AA"^  =  (AN)  (AN)+  (AN)  (N’^A"^)  =  ANN'^  A'*’  = 
AA'*'  =  R,  (LR)"‘  =  r'"  =  R  =  r'^L*  =  RL.  Let  B  =  AN.  Then  RL  =  (BN'^) 


(BN"^)‘''BB''' 


(BN’S  (BN’S''’  (BN’S  NB'^ 


BN’^  NB"''  =  BB'^ 


R  =  R  L  =  L,  (AN)  (AN)"’’  =  . 


Thus 


(A'M  )'^  =  a'  a 


Using  this  identity  we  see  that  (AM) 


Transposing  we  get  (MA)^  (MA)  =  A^  A,  q.  e.d. 


Lemma  5:  Let  A(mxn),  and  N(nxn)  nonsingular.  Then  (A,  b)  = 
NW'(AN,b). 

Proof:  Let  x  =  A'*'b  +  (I  -  A^A)  v  e?i/(A,  b).  Then  AN  [N’^x  -  (AN)^b]  = 
Ax  -  (AN)  (AN)^h  =  AA'^^b  -  AA^b  =  0,  N  ^  x  e/2/(AN,b)  using  Lemma  2, 

N  ^  V(A,  b)  <  ^(AN,  b),  /l/-(A,b)  <  N?i/(AN,  b).  Using  this  identity  we  see  that 
?AAN,b)  <  N’^A^XANN’Sb)  =  N’^A‘(A,b),  N?/-(AN,b)  <  ?/(A,b.).  and  so 
MA,b)  =  NW-(AN,b). 

2 

Lemma  6:  Let  V  =  S  be  positive  definite  (nxn),  A(mxn),  b(mxl). 

Then  the  vector  (nxl)x  of  least  |ix||.^  in  y-{A,h)  is  given  by  S’^(AS’S^b- 

- 1  - 1  +  + 

Proof:  Let  u  =  S  (AS  )  b.  Then  Au  =  AA  b  using  Lemma  4, 

A  (u  -  A^b)  =0,  u  €  >^(A,  b)  using  Lemma  2.  Now  )/(A,  b)  =  S’^  ;k^(A,S’Sb) 
using  Lemma  5.  Let  x  S  y-{A,h) .  Then  x  =S’^y  where  ye^(A,  S’  S  b). 

Now  llyll  s:  |l(AS'S''’b||  by  Lemma  1.  Thus  ||x||.^  =  ||Sx|l  =  ||y||  £  ||(A,  S’ S''’b|l  = 
||s  ^  (A,  S  S^b||^  =  l|u||.^  and  so  u  is  the  element  of  least  "V"  norm  in 
WA,b)., 


Lemma  7:  The  following  statements  are  equivalent: 

(1)  The  columns  of  A  are  linearly  independent. 

(2)  A'*  A  is  non  singular. 

(3)  A‘*'A  =  1. 

Proof:  (1)  -•  (2):  Let  A*Ax  =  0.  Then  0  =  x*^A*Ax  =  ||Ax||^,  0  =  Ax, 

X  =  0.  We  now  show  (2)  “*  (3).  Let  u  =  (A^  A  -  I)  y.  Then  a''au  =  0,  u  =  0, 
and  (A^  A  -  I)  y  =  0  for  all  y,  A^  A  =1.  Lastly,  we  prove  (3)  "*  (1).  Let 
Ax  =  0.  Then  A^  Ax  =  0=x,  q.e.d. 
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Theorem  3;  The  eq^-ajation  Ax  =  b  has  a  solution  (vector)  x  if  and  only 
if  AA^b  =  b.  If  the  lattes r  equality  holds  then  x  is  a  solution  if  and  only  if 
xeVCAib). 

Proof:  If  AA^b  =  b  then  clearly  A^b  is  a  solution.  Conversely  suppose 
Ax  =  b  for  some  xCfi^.  Then  AA^b  =  AA'^Ax  =  Ax  =  b.  This  proves  the 
first  statement.  To  pro -^e  the  second  statement,  let  A  A^b  =  b.  Then  x  is  a 
solution  of  Ax  =  b  if  anc3  only  ifO=Ax-b=Ax  -  AA^b  =  A(x  -  A^b).  By 
Lemma  2  this  is  true  if  ^nd  only  if  x  e  y-(A,  b). 

Theorem  4:  (Least  Squares)-For  A(mxn)  and  b(mxl),  the  set  of  all 
(nxl)  vectors  x  such  th_at  ||Ax-b||  is  a  minimum,  is  jl^(A,b).  Also,  the 
nxl  matrix  (vector)  of  Iteast  norm  such  that  ||b  -  Ax||  is  minimized,  is  A'*^b. 

Corollary  1:  For  A_  (mxn)  and  b(mxl),  and  W(mxm)  =  R  R  which  is 
positive  definite,  the  set  of  all  (nxl)  vectors  such  that  ||Ax-b||^  is  a  mini¬ 
mum,  is  ^/•(RA,  Rb).  TFhe  vector  of  least  norm  such  that  Ax-b  is 
^  "  "  w 

minimized,  is  (RA)  Rb  « 

2  2 
Corollary  2:  Let  V  =S  Toe  positive  definite  (nxn),  W  =  R  positive 

definite  (mxm),  A(mxr:a-),  b(mxl).  Then  the  set  of  all  (nxl)  vectors  such 

that  ||Ax-b||^  is  a  minLrmum,  is  j|^(RA,  Rb). 

The  vector  of  least  •  *¥"  norm  such  that  Ax  -  b  is  minimized  is 

11  w 

S‘^  (RAS*^‘’'Rb. 

Proof;  Let  x  €  G  .  "Then 
- : -  n 

||b-Axl|  ^=  ||a  A'*"(b  -  A  x)|P  +  j|  (I  -  A  A'*’)  (b  -  A  x)  ||:^  = 

||AA'''(b-A  x)||:^+  ||(I  -  A  A"'')  b  ||  ^  = 

■l|A(A+b-x>.  |P  +  11(1- AA+)bi|^ 

Thus  X  minimizes  ||b  -  A^x||  if  and  only  if  A  (A^b  -  x)  =  0  which  is  true  if  and 
only  if  X  e  ^(A,  h)  using  JXemma  2.  This  proves  the  first  statement  of  the 
theorem.  The  second  foUlows  immediately  from  Lemma  1.  The  proof  of  the 


r 
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Corollary  1  follows  directly  from  thei  identity  ||Ax-bi|^  =  ||R(Ax-b)||.  The 
first  statement  of  Corollary  2  is  a  restatement  of  part  of  Corollary  1.  The 
last  statement  of  Corollary  2  is  an  immediate  consequence  of  Lemma  6. 

Theorem  5:  Let  A  be  mxn  and  z  be  any  mxl  matrix  (vector).  Then 
there  exists  mxl  matrices  (vectors)  x  and  y  such  that 


(1)  z  =  X  +  y 

(2)  X  is  in  the  column  space  of  A 

(3)  y  is  orthogonal  to  the  column  space  of  A 

Any  vectors  satisfying  (1)  -  (3)  above  are  unique,  and 

(4)  X  =  A  A'*'  z 

(5)  y  =  z  -  A  A"*"  z 

(6)  x*y  =  0 

Thus  A  A"*"  is  the  projection  which  takes  any  column  vector  (mx  1)  into  the 
column  space  of  A;  -  AA"*"  is  the  projection  which  takes  any  (m)  vector  into 
the  orthogonal  complement  of  the  column  space  of  A.  The  proof  of  this 
theorem  follows  directly  from  (4),  (5),  and  (6)  above. 

II.  THE  GAUSS-MARKOFF  THEOREM 


J/ will  designate  a  vector  space  of  real  valued  random  variables  over  the 
.real  numbers  R  .  There  is  assumed  to  be  a  linear  functional  called  "expected 
value"  on  V  to  R  ,  and  written  v  (the  "expected  value"  of  v)  for  any  ve^. 
Thus  for  a,  b  e  and  x,  y  €  y  we  have  '5  =  a  and  a  x  +  b  y  =  ax  +  by.  Capital 
English  letters  will  designate  matrices.  Let  A  =  (^^j)-  Then  the  terminology 
"A  constant"  below  will  be  used  to  indicate  that  the  a.,  are  in  R  ;  "A  variable" 
will  indicate  that  the  a^j  are  in  Y .  In  general,  capital  English  letters  near 
the  beginning  of  the  alphabet  will  be  used  for  constant  matrices  and  letters 
near  the  end  of  the  alphabet  will  be  used  for  variable  matrices.  The  expected 
value  functional  on  Y  to  R  may  be  extended  to  a  linear  function  on  variable 
matrices  to  constant  matrices  by  defining  V  =  (v. .)  for  any  variable  V  =  (v. .). 
The  notation  will  designate  the  set  of  all  variable  (nxl)  matrices. 
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Elements  of  1/ ^  will  also  be  called  n  dimensional  variable  vectors,  or  jtist 
variable  vectors,  if  the  dimension  is  clear  from  the  context. 

Lemma  8:  Let  A,  B  be  constant  and  V  variable.  Then 

(1)  (Y)*  =  (V*) 

(2)  AVB  =  AVB  if  the  multiplication  is  defined. 

Proof:  Follows  directly  from  the  definitions  and  linearity  of  the  "expected 
value"  operator. 

Definition:  Let  v  be  a  variable  vector.  Then  $  (v)  =  (v  -  v)  (v  - 
and  (v)  is  called  the  variance -covariance  matrix  of  v. 

Lemma  9:  Let  A  be  constant  and  v  be  a  variable  vector.  Then^ 

$  (Av)  =  A  t  (vj  A*. 


Let  (H)  designate  the  hypothesis  y  is  an  (mxl)  variable 
vector,  X  is  an  (nxl)  constant  vector,  A(mxn)  is  constant,  y  =  Ax,  and 
^(y)  =  W  =  is  positive  definite,  R  =  R'.  Let  i]!  =  c*''x,  for  some  constant 

c(nxl).  We  shall  call  an  estimate  T  of  t  linear  if  t  =  g^y  for  some  con¬ 

stant  g(mxl),  and  unbiased  if  =  f. 

Theorem  6  (Gauss -Markoff):  Let  i|(  be  as  above  and  assume  (H).  Then 
'll  has  an  unbiased  linear  estimate  if  and  only  if  A"*"  Ac  =  c.  In  the  latter  case 
c  (R  A)  R  y=  is  an  unbiased  linear  estimate  of  f  having  minimal 

variance  c'''(A'’'W  ^  A)+  c  in  the  class  of  all  unbiased  linear  estimates  of  ^  . 

If  A"*”  A  =  I,  then  =  c  (A*  W  ^A)  ^A  W  ^y  and  the  variance  of  is 

c  (A  W  A)  c. 

Proof:  Suppose  ilf  has  the  unbiased  linear  estimate  i)i  =  g  y.  Then 
C  X  =  f  =  111  =  g'''y  =  g*Ax  for  all  x,  c  =  g*A,  c  =  A*g,  A+Ac  =  A''’AA*g  =  A  g 

c.  Conversely  suppose  A'^'Ac  =  c.  Then  c'^A"'"  A  =  c*  and  so  using  Lemma 

4,  we  have  Tj^y.=  c'''(R"^  A)  +  R‘^  y  =  c*{R"^A)‘''(R'^A)x  =  c*A'''Ax  =  c’"  =  iIi  so 
f  j^y  is  clearly  an  unbiased  linear  estimate.  This  proves  the  first  statement 


This  may  be  proved  by  direct  computation 

Consult  Theorem  2  for  the  appropriate  identity  used. 
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of  the  theorem.  By  the  first  part  of  the  proof,  any  unbiased  linear  estimate 


?  =  g*  y 


must  satisfy  the  property  A*g  =  c.  Also  $(i|()  =g*t(y)  g  =g*R^g. 

Thus  the  unbiased  linear  estimate  of  minimaT  variance  is  given  by  the  sdlu- 

tion  A’  g  =  c  such  that  ||g||  is  minimized.  The  latter  solution  is  given  by 

g^  =  R  (A"  R  )''’c,  which  yields  as  the  estimate  sought.  Also 

=:<  2  ..+  „iT„2„-l  ,  _ 


A)'^R  ‘R  R 


(A*R-1)^  ctt'=  c*(A*R‘^A)^  c 


(A  ■  R 

4:  S!t  _  1  J. 

c  (A  W  ^A)^c. 


)"  c  =  c  '  (R  "  A) 

If  A'*’ A  =  I  then  the 
-1 


columns  of  A  are  linearly  independent.  Then  the  columns  of  R  A  are 
linearly  independent  and  =  c  (R  ^  A)^  R  ^  y  =  c  ’  [(R  ^  A)  (R  ^  A)  ] 

(R  A)  R  y  =  c  (A  W  A)'1a  W  y,  q.e.d. 


Suppose  we  assume  (H)  and  define  =  (R  ^  A)^R  ^  y.  Then 

A)  +  using  the  same  argument  as  in  the  proof  of  the  above 

theorem.  Note  that  now  enjoys  the  following  properties: 

!;=  ■  .  ,  'i'  .1 

(1)  If  t  -  c  X  has  an  unbiased  linear  estimate,  then  c  is  an 

unbiased  linear  estimate  of  t  having  minimal  variance  c*$(Xj^y)  c  in  the 

class  of  all  unbiased  linear  estimates.  This  follows  directly  from  the 

theorem  above. 


(2)  The  vector  x, is  the  vector  of  least  norm  such  that  S(W)  = 

...  V  M  V 

(y-Ax)’''w”  (y-Ax)  =  ||y-  Ax||.^_2  is  minimized.  This  follows  directly 
from  the  (least  Squares)  Theorem!'  If  for  an  arbitrary  positive  definite 
matrix  W,  x  _  (W)  is  defined  to  be  the  vector  of  least  norm  which  minimizes 
S(W)  ,  then  (W)  is  called  the  generalized  weighted  least  square  s  e stimate 

corresponding  to  the  weighting  matrix  W.  Thus  is  Xj^g($(y)). 


T 

'See  Theorem  4,  Corollary  2 
+  + 

Consult  Theorem  2  for  the  appropriate  identity  used 

•'•  +  + 

'See  Theorem  4,  Corollary  1 
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III.  PARAMETER  ESTIMATION  FOR  ORBIT  DETERMINATION 

Theorem  7:  Let  vj^  and  be  two  independent  variable  vectors  in  "Y ^ 
such  that  Vj^  =  v^  a-nd  =  $  (v^)  is  positive  definite  (i  =  1,2).  Let  v  = 

+  V2^)‘Mv"^  v^  +  V2^  v^).  Then  V  is  unbiased  (i.  e.  ,  w--v^=v^  and 
V  has  minimal  variance  covariance  matrix  +  V2^)  ^  in  the  unbiased 

class  of  vectors  +  (I  -  R)  V2  for  R  constant  (nxn). 

Proof;  The  fact  that  $  (v)  =  (Vj^  +  Y ^  )  follows  by  direct  computation. 
Let  y  -  Rvj  +  S  V2  where  R  Is  constant  (nxn)  and  S  =  I  -  R.  Then  clearly  y 
is  unbiased.  We  wish  to  show  that  $  (y)  ^  Jlv)  ,  which  is  defined  to  mean 
A  =  $  (y)  -  i(v)  is  positive  semi -definite.  By  direct  calculation  A  =  RV,  R  + 

1  ^  t  * 


+  V2V^ 

-1  ^ 


SV2S'  -  (V, 

B  - 

[V2S 

where  C  (V 
DQ^D*  -  (p2  +  O'")  where 
(CP-P)  (CP- P)*  +  (D  O -Q)  (DQ-Q)*. 


Then  A  =  (V"^  +  V"^)‘^  B(V‘^  +  V"^"^  where 
,-l  r,,  .  r,,.--!  .  n  .,r-l 


[(Vj  +  V‘^  RVj  ]  V‘^  [Vj  R  (V-^  +  V'^)]  +  [(V‘^  +  V'M  SV2]  v‘ 
+  v:^)]  -  (v:^  +  v:^),  b  =  cv:^  c*  +  dv:^  d*  -  (v‘^  +  v:*^ 

i  1^-1^  ^  1  \  ^  ^  ^ 

tV^'lRVj,  D.(V,1  tV£l)  sv 

X  o2^  =  p2  p  _  p* 


Thus  B  =  C  P^  C*  + 


=  Q  ,  Q  =  Q 


Let  F 


(DO-Q)  (QD*-Q)  =  CP^C*  -  CP^-P^C 

,2  _ —,2  .  _„2_'I‘  ,  r/-r-.2  .  ^2,  ,^.,.,2 


r-1 


Then  F  =  (C  P  -  P)  (P  C 

D 
2 


2  2  >J 

+  P^  +  DQ^D 


P)  + 
2  2  >!< 
-DQ^-Q^D  4 

*  _  2  _ 


0“  =  CP‘'C'" +DQ  D'  +  [(P'^  +  Q1  -  (CP  +  DO  )  -  (P'^C  +  D  )  ]  .  But 


P^  +  and 
CP^C*  + 


C  P^  +DQ^  =  C  +  DV'^  =  (V‘^  +^2^  (R  +  S)  =  V'^  +  V'^ 

by  transposing  both  sides,  P^  C  '  +  D'^  =  P^  +  Q^.  Thus  F 

2  2  2 

DQ  D"  -  (P  +  Q  )  =  B  and  B  is  positive  semi-definite,  since  B  =  F,  which 
is  the  sum  of  two  terms  of  the  form  MM  and  hence  positive  semi  -  definite . 
Then  A  is  also  positive  semi-definite  since  A  is  of  the  form  MBM  ,  q.  e.  d. 

We  shall  now  discuss  applications  of  the  preceding  material  to  parameter 
estimation  in  orbit  determination  work.  Let  f  (a,  p)  be  a  vector  (nxl)  valued 
function  of  the  vector  variables  a(rxl)  and  p(sxl);  f  (a,  p)  is  the  vector  of 
observations  without  noise  given  the  (vector)  parameters  a  and  p.  Here  a  is 
some  parameter  vector  we  wish  to  estimate  and  determine  the  variance  co- 
variance  matrix  of  the  estimate;  p  is  some  parameter  vector  which  we  do  not 


The  general  notation,  necessary  background  and  essential  formulae  parallel 
the  development  in  Reference  5. 


r 
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wish  to  estimate,  but  wish  to  account  for  the  effect  of  an  uncertainty  in  p  on 

the  variance -covariance  of  the  estimate  of  a.  We  shall  assume  that  a^  and 

p^  are  the  "true"  estimates  of  a  and  p,  respectively.  Assume  that  f  (a,  p)  is 

linear  in  the  neighborhood  of  (a^,  p^)  so  that  f  (a,  p)  =  f  (a^,  p^)  +  A(a  -  a^)  + 

P(p-  p  )  provided  a  and  p  are  sufficiently  close  to  a  and  p  ,  respectively, 

and  A,  P  are  constant  nxr  and  nxs  matrices.  Let  R  and  e  be  (nxl) 

vector  variables  satisfying  R  =  f  (a^,  p^)  +  e,  with  "6  =  0.  Note  that  R  is  the 

vector  of  observations  with  noise;  the  noise  is  assumed  to  have  mean  zero. 

Let  c  be  a  constant  (rxl)  vector.  Then  R  -  f  (c,  p^)  =  f  (a^,  p^)  -  f  (c,  p^)  +  e  =  A 

(a^  -  c)  +  e,  ll  -  f  (c,  p^  =  A(a^  -  c)  and  the  generalized  weighted  least  squares 

estimate  for  a  -c  would  be  given  by  (a  -  c)  ^  =  M  (R  -  f  (c ,  p  ))  =  M  (A  (a.  -  c)  +  e) 

where  M  =  (A  W  A)  A  W"^  for  the  weighting  matrix  W."^  Since  MA  =  I  we 

have  (a  )  T  ^  ~  c  =a-c  +  Me,  and  so  (a  o  “  ^4-  =  ^  Now  suppose  a  is 

a  (rxl)  vector  variable  such  that  a^  =  a^  and  a^  -  a^  is  independent  of  e. 

Then  a  -  a  is  independent  of  (a  )  -  a  and  the  minimal  variance  unbiased 

O  fc  V  I  fc 

linear  combination  of  the  two  is  given  by: 


=  K(V(a^)j^g  +  Ua^)  where  V  =  ^  =  ^(^0^ 


-1 


and 


K  s  (U  +  V)'^  Then  a^  =  K  [v  [c  +  M  (R  -  f  (c,  p^  ]  +  U  a^j 

Now  V  =  (M$(e)M*)'^  and  if  we  assume  c  =  a^  then  a^  =  a^  +  K V M  (R  -  f(a^,  p^)) . 
Define  a^  by 

a^  =  a^  +  KVM(R-f(a^,p^)) 

=  a^  +  KVM(R  -f  (a^,p^)  +  f  (a^,  p^)  -  f(a^,p^)) 

=  a^  +  K V  M  (€  +  A  (a^  -  a^)  +  P  (p^,  -  p^)) 


We  have  assumed  A  A  is  nonsingular,  which  is  equivalent  to  A  A  =  I.  See 
Lemma  7. 


o 
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=  K(V  +  U)  (a^  -  a^)  +  K{VMe  +  V  (a^  -  a  J  +  VMP(p^-p^)) 

=  K  [  V  M  e  -  U  (a^,-  a^)  +  V  M  P  (p^  -  p^)  ] 

4(KVMe)  =  :KVMt(e)  M*VK  =  KVK,  V  =  (Mf  (e)  M*)"^  = 

[(a"w'^  a)'^  a*w"^  ${e)  w'^  a(A''‘w‘^  a)"^]"\ 

t(a^-a^)  =  K  [V  +  VMP|:(pj.-p^)  P*M*''v  +  t(a^)'^  ]  K. 

J(a^-a^)  =  K+KVMP$(Pj. -p^)  P*M*VK. 

Note  that  if  W  =  |:(€)  then  V  =  A*W‘^  A,  V  M  =  A*W■^  and  ^(a^-a^)  =  K  + 
KA*W‘^  P$(p^  -  pJP'W"^  AK. 
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